Cluster-based data filtering for manufacturing big data systems
نویسندگان
چکیده
A manufacturing system collects big and heterogeneous data for tasks such as product quality modeling data-driven decision-making. However, the size of grows, timely effective utilization becomes challenging. We propose an unsupervised filtering method to reduce sets with multi-variate continuous variables into informative small sets. Furthermore, determine appropriate proportion be filtered, we a information criterion (FIC) balance tradeoff between filtered preserved. The case study babycare simulation have shown effectiveness proposed method.
منابع مشابه
Item based Collaborative filtering approach for Big Data application
Spurred by Service computing and cloud computing an increasing number of services are emerged in the Internet. As a result, resource -permissible data arise too big to be handled by already established techniques. Clustering along with Item based collaborative filtering has been proposed to reduce online execution time taken for processing services. Technically, these approaches sanction around...
متن کاملBayesian Conditional Density Filtering for Big Data
We propose a Conditional Density Filtering (C-DF) algorithm for efficient online Bayesian inference. C-DF adapts Gibbs sampling to the online setting, sampling from approximations to conditional posterior distributions obtained by tracking of surrogate conditional sufficient statistics as new data arrive. This tracking eliminates the need to store or process the entire data set simultaneously. ...
متن کاملProviding Flexible File-Level Data Filtering for Big Data Analytics
The enormous amount of big data datasets impose the needs for effective data filtering technique to accelerate the analytics process. We propose a Versatile Searchable File System, VSFS, which provides a transparent, flexible and near real-time file-level data filtering service by searching files directly through the file system. Therefore, big data analytics applications can transparently util...
متن کاملBiDAl: Big Data Analyzer for Cluster Traces
Modern data centers that provide Internet-scale services are stadium-size structures housing tens of thousands of heterogeneous devices (server clusters, networking equipment, power and cooling infrastructures) that must operate continuously and reliably. As part of their operation, these devices produce large amounts of data in the form of event and error logs that are essential not only for i...
متن کاملA Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection
Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Quality Technology
سال: 2021
ISSN: ['2575-6230', '0022-4065']
DOI: https://doi.org/10.1080/00224065.2021.1889420